Compression of binary documents using pattern recognition
نویسنده
چکیده
To preserve all the information in analog documents when they are digitized , high resolution scanning of them has become common. When the ana-log document has a large physical size, which is frequently the case, the resulting digital image occupies a large storage space. Hence, methods that compress document images substantially without loss of signi-cant information are of great practical importance. In this paper we describe a compression method for binary documents based on pattern recognition of symbols and straight lines. Symbols recognized by statistical classication are saved using the ASCII values and the positions of the symbols in the document. Straight lines are detected by a modied Hough transform which also recognizes the end points of each line. Detected lines are represented by the end points. Unlabeled objects , the residue, are saved by run-length coding the bitmap. The best achieved compression ratio was several times the compression ratio achieved with run-length, Lempel-Ziv and CCITT's MRC-II, at a resolution of 500 ppi.
منابع مشابه
Authorship analysis based on data compression
6 This paper proposes to perform authorship analysis using the Fast Compression Distance (FCD), a similarity measure based on compression with dictionaries directly extracted from the written texts. The FCD computes a similarity between two documents through an effective binary search on the intersection set between the two related dictionaries. In the reported experiments the proposed method i...
متن کاملModelling of Eyeball with Pan/Tilt Mechanism and Intelligent Face Recognition Using Local Binary Pattern Operator
This paper describes the vision system for a humanoid robot, which includes the mechanism that controls eyeball orientation and blinking process. Along with the mechanism designed, the orientation of the camera, integrated with controlling servomotors. This vision system is a bio-mimic, which is designed to match the size of human eye. This prototype runs face recognition and identifies, match...
متن کاملLocal gradient pattern - A novel feature representation for facial expression recognition
Many researchers adopt Local Binary Pattern for pattern analysis. However, the long histogram created by Local Binary Pattern is not suitable for large-scale facial database. This paper presents a simple facial pattern descriptor for facial expression recognition. Local pattern is computed based on local gradient flow from one side to another side through the center pixel in a 3x3 pixels region...
متن کاملProposing an effective approach for Network security and multimedia documents classically using encryption and watermarking
Local binary pattern (LBP) operators, which measure the local contrast within a pixel's neighborhood, successfully applied to texture analysis, visual inspection, and image retrieval. In this paper, we recommend a semi blind and informed watermarking approach. The watermark has been built from the original image using Weber Law. The approach aims is to present a high robustness and imperceptibi...
متن کاملClassification of emotional speech using spectral pattern features
Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...
متن کامل